Analytics at MetaForecasting@Meta: Balancing Art and ScienceBy Steven Gin and Kevin Birnbaum10 min read·1 day ago--
Mikkel DengsøeHigh-impact data governance teamsHow to drive impact, where to focus, and what skills are required to succeed in the best data governance teams14 min read·2 days ago--1
Vu TrinhinData Engineer ThingsHow Twitter processes 4 billion events in real-time dailyFrom Lambda to Kappa6 min read·May 25, 2024----
Gabe Araujo, M.Sc.How to Build End-to-End Data Pipelines with PythonStarting with Extraction·9 min read·5 hours ago----
Alexandre Magno Lima MartinsHow we orchestrate 2000+ DBT models in Apache AirflowIn recent years, DBT (Data Build Tool) has established itself as the go-to data transformation workflow, connecting to a variety of…13 min read·6 days ago--5--5
Analytics at MetaForecasting@Meta: Balancing Art and ScienceBy Steven Gin and Kevin Birnbaum10 min read·1 day ago--
Mikkel DengsøeHigh-impact data governance teamsHow to drive impact, where to focus, and what skills are required to succeed in the best data governance teams14 min read·2 days ago--1
Vu TrinhinData Engineer ThingsHow Twitter processes 4 billion events in real-time dailyFrom Lambda to Kappa6 min read·May 25, 2024--
Gabe Araujo, M.Sc.How to Build End-to-End Data Pipelines with PythonStarting with Extraction·9 min read·5 hours ago--
Alexandre Magno Lima MartinsHow we orchestrate 2000+ DBT models in Apache AirflowIn recent years, DBT (Data Build Tool) has established itself as the go-to data transformation workflow, connecting to a variety of…13 min read·6 days ago--5
Barr MosesinTowards Data ScienceThe Past, Present, and Future of Data Quality Management: Understanding Testing, Monitoring, and…The data estate is evolving, and data quality management needs to evolve with it.9 min read·May 25, 2024--1
Vu TrinhEverything you need to know about MapReduceAll the key insights from the paper MapReduce: Simplified Data Processing on Large Clusters from Google10 min read·12 hours ago--
Dunith DanushkainTowards Data ScienceReal-Time Analytics Solution for Usage-Based API Billing and MeteringDesign a real-time analytics pipeline for tracking API invocation usage with Apache APISIX, Redpanda, and Apache Pinot.·11 min read·May 24, 2024--1